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Abstract 



We describe the processes obtained by time reversal of a class of stationary jump-diffusion 
processes that model the dynamics of genetic variation in populations subject to repeated 
bottlenecks. Assuming that only one lineage survives each bottleneck, the forward process is 
a diffusion on [0, f ] that jumps to the boundary before diffusing back into the interior. We 
show that the behavior of the time-reversed process depends on whether the boundaries are 
accessible to the diffusive motion of the forward process. If a boundary point is inaccessible 
to the forward diffusion, then time reversal leads to a jump-diffusion that jumps immediately 
into the interior whenever it arrives at that point. If, instead, a boundary point is accessible, 
then the jumps off of that point are governed by a weighted local time of the time-reversed 
process. 

1 Introduction 

Kingman's observation that the genealogy of a random sample of individuals from a panmictic, 
neutrally-evolving population can be represented as a Markov process jTHl [T7] ranks as one of 
the most influential contributions of mathematical population genetics. Not only has the coa- 
lescent led to a deeper understanding of evolution in neutral populations, but it also plays a 
central role in statistical genetics where it facilitates the efficient simulation of sample genealogies. 
Unfortunately, the Markov property that makes Kingman's coalescent both mathematically and 
computationally tractable is usually not shared by genealogical processes in populations composed 
of non-exchangeable individuals. In particular, this is true when there are fitness differences be- 
tween individuals, since then the selective interactions between individuals cause genealogies to 
depend on the history of lineages that are non-ancestral to the sample. The key to overcoming 
this difficulty is to extend the genealogy to a higher-dimensional process that does satisfy the 
Markov property. This has been done in two ways. One approach is to embed the genealogical 
tree within a graphical process called the ancestral selection graph [18, 24, 6 in which lineages can 
both branch and coalesce. The intuition behind this construction is that the effects of selection 
on the genealogy can be accounted for by keeping track of a pool of potential ancestors which 
includes lineages that have failed to persist due to being out-competed by individuals of higher 
fitness. 

An alternative approach was proposed by Kaplan et al. (1988) [T2], who showed that the ge- 
nealogical history of a sample of genes under selection can be represented as a structured coalescent 
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process. Here we think of the population as being divided into several panmictic subpopulations 
(called genetic backgrounds) which consist of individuals that share the same genotype at the 
selected locus. Because individuals with the same genotype are exchangeable (i.e., they have the 
same fitness), the rate of coalescence within a background depends only on the size of the back- 
ground and the number of ancestral lineages sharing that genotype. Thus, to obtain a Markov 
process, we need to keep track of two kinds of information: (i) the types of the ancestral lineages, 
and (ii) the frequencies of the alleles segregating at the selected locus, followed backwards in time. 
For many applications it is assumed that the population is at equilibrium and that the forwards 
in time dynamics of the allele frequencies are described by a stationary diffusion process. In this 
case, the ancestral process of allele frequencies can be identified by time reversal of the diffusion 
process. In particular, if the diffusion process is one-dimensional, then the time-reversed process 
conveniently has the same law as the forward process. A formal derivation of the structured co- 
alescent process for such an equilibrium population is given in [5] and various applications are 
discussed in [TJ 03 [3D] . 

The focus of this article is on the time reversal of a population genetical model that incorporates 
mutation, selection, genetic drift and population bottlenecks. To be concrete, consider a locus with 
two alleles, Aq and Ai, and let p N (t) denote the frequency of A\ at time t in a population of size N. 
In the absence of bottlenecks, we will suppose that the jump process p N '(•) can be approximated 
by the Wright-Fisher diffusion p(-) with generator 

M{p) = \p{± - p)<t>"(p) + (Mo(l - p) ~ tup + «(p)p(i - pM(p) 

1 „ , (1) 

= ^v(p)(t)" {p) + {i(p)(j)' (p), 

where [io and [i\ are the scaled mutation rates from Aq to A\ and from A\ to Aq, respectively, 
and s(p) is the scaled and possibly frequency-dependent selection coefficient of Ai relative to Aq. 
In using the diffusion approximation, we assume that N is large, that time is measured in units 
of N generations, and the unsealed mutation rates and selection coefficient are of order A -1 . 
Convergence results justifying the passage to the diffusion limit can be found in [7J. 

Population bottlenecks are transient events during which most of the population is descended 
from a small number of individuals. On the diffusive time scale, these can be modeled as instanta- 
neous jumps in the allele frequencies, and in this article we will be concerned with a class of models 
in which the bottlenecks always result in the temporary fixation of one of the two alleles, i.e., p(-) 
always jumps to or 1. We have two scenarios in mind. In the first, we consider a locus that is 
part of a non-recombining segment of DNA (e.g., a mammalian mitochondrial genome) subject to 
strong selective sweeps which occur at rate A. During each sweep, a unique copy of a favorable 
mutation arises at some linked site and rises rapidly to fixation. Depending on whether the new, 
strongly-selected mutation occurs on a chromosome carrying an A\ or Ai allele, the frequency of 
A\ will either increase from p to 1 with probability p or decrease from p to with probability 1—p. 
Here we imagine that the selective advantage of the favored mutation is so strong that this change 
can be treated as a jump. The pseudohitchhiking model introduced by Gillespie [TO] belongs to 
this class, as does a related, more general model studied by Kim [T5] . 

The second scenario concerns demographic bottlenecks that occur during transmission of par- 
asites from infected to uninfected hosts. Here we will let p denote the frequency of Ai in a 
chronological series of infected hosts linked by a transmission chain, and we will assume that p(-) 
can be modeled by a diffusion process from the time when one of these hosts is first infected to 
the time when that host first transmits the infection to the next host in the transmission chain. 
Suppose that transmissions occur at rate A, and that each new infection is founded by a single 
parasite, as has been proposed for HIV-1 ; 3l] and for some bacterial pathogens [29]. In this case, p 
will jump to or 1 following each transmission depending on the type of the transmitted parasite. 
Also, to allow for the possibility that transmission itself might be selective (e.g., [57]), we will let 
w(p) denote the probability that the transmitted parasite is of type Ai given that the frequency 
of this allele in the transmitting host is p. In general, we stipulate that w(0) = 0, w(l) — 1, and 
that w(p) is monotonically increasing. If transmission is unbiased, then w(p) — p, as in the pseu- 
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dohitchhiking model. A particular case of this transmission chain model was studied by Rouzine 
and Coffin [35J to understand the effects of selection and transmission bottlenecks on antigenic 
variation in HIV-1. 

Both of these scenarios can be modeled by a jump-diffusion process with infinitesimal generator 

G<t>ip) =2^( 1 ~ P) < t>"{p) + (Mo(l -p)~ MiP + s(p)p{l - p))4>'{p)+ ^ 
Xw(p) (0(1) - 4>{p)) + A(l - w(p)) (0(0) - 4>{pj) , 

where for technical reasons we will assume that s{p) and w(p) are smooth functions on [0, 1], and 
that both mutation rates, fio and fj,%, are positive. Under these conditions, it can be shown (cf. 
Lemma 3.1) that the process p(-) has a unique stationary distribution, 7r(p)dp, which has a density 
on [0, 1]. To characterize the structured coalescent process corresponding to this model, we need 
to identify the stationary time reversal of the process p(-). Formally, this can be done by solving 
the following adjoint problem for the operator G: 

ip(p)G<j)(p)Tr(p)dp = / cj)(p)Gil)(p)n(p)dp, (3) 
o J o 

where <p is in the domain of G. If G generates a Markov process p{-), then this process will have 
the same law as the stationary time reversal of p(-) [23]- When A = 0, p(-) is a diffusion process 
and a simple calculation using integration-by-parts shows that G = G, demonstrating that the 
law of the diffusion is invariant under time-reversal, as remarked above. However, if A > 0, then 
for the adjoint condition Q to be satisfied for all </> € C 2 (R) nC[0, 1], we must instead set 

G^(p) = ~p(l - p)<(p) + £W(P), (4) 



where 



A(p) = (Xi - pX(p) + (i - 2 P - Kp)Mp)) (5) 



7T (p) 

and V € C 2 (R) D C[0, 1] satisfies 

m= r\(p)(^^-)dp and m = /% ( p)f (i ~r (p)Mp) )^ 



1 - K 



with k = w(p)ir(p)dp. Although it is not immediately clear that the operator defined by Q 
is the generator of a Markov process, this calculation does show that the process incorporating 
bottlenecks is not invariant under time reversal. 

To gain some insight into the qualitative behavior of the time-reversed process, it is useful 
to consider two heuristic descriptions. We begin by observing that the behavior of p(-) depends 
strongly on whether the boundary points {0, 1} are accessible or inaccessible to the diffusive motion 
of the forward process. Recall that for the Wright-Fisher diffusion corresponding to A (which we 
call the diffusive motion of the jump-diffusion process), Feller's boundary classification conditions 
show that (resp. I) is accessible if and only if u < 1/2 (resp. u x < 1/2), see e.g. Section 4.7 
in [5J. The importance of this distinction is illustrated in Figure 1, which shows sample paths of 
the jump-diffusion process corresponding to cases where the two boundaries are either inaccessible 
(A) or accessible (B) to the forward diffusion. To see what this suggests about the behavior of 
the time-reversed process, begin at the top of each figure and follow the sample path backwards 
in time towards the bottom. If the boundaries are inaccessible, then whenever the sample path is 
followed back to a boundary at some time, the forward process will necessarily have reached that 
boundary via a jump. Consequently, the time-reversed process must immediately jump into the 
interval (0, 1) whenever it arrives at a boundary that is inaccessible to the forward diffusion. The 
behavior of the time-reversed process at a boundary that is accessible to the forward diffusion is 
very different. In this case, when the sample path of the time-reversed jump diffusion hits that 
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boundary, the forward process may have arrived there either diffusively or via a jump from the 
interior (Figure IB). Accordingly, the time-reversed process need not immediately jump into the 
interior (0, f ) when it visits the boundary, although jumps can only occur when the process is on 
the boundary and are certain to occur at some such times if A > 0. 

A: Inaccessible Boundaries B: Accessible Boundaries 




Figure 1: Sample paths of the jump-diffusion process Q with either inaccessible (A) or accessible 
boundaries (B) . The forward diffusion is a neutral Wright-Fisher process with symmetric mutation: 
Mo = Mi = 1 m A and 0.2 in B. 

A more quantitative picture of this second case can be obtained by considering a less singular 
process that approximates the jump-diffusion process corresponding to For e e (0,1/2), 

let p e (-) = (p e (t) ■t>0) be a perturbation of a Wright-Fisher diffusion which at rate A jumps 
to a point chosen uniformly at random from an interval of width e adjacent to one of the two 
boundaries. More precisely, let p e (-) be the Markov process with generator 

G e <j>(p) = ^p(l -p)4>"(p) + (mq(1 - p) - tMip + s(p)p(l-p))<t>'(p) + 

A W>(?) - m)dq + (1 - w{p))- e j\<j>{q) ~ m)dq\ • 

Writing ir e (p) for the density of the stationary distribution of this process, a simple calculation 
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using ([3]) shows that the stationary time reversal of p e (-), denoted Pe(-), is also a jump diffusion 
process with generator 

GMp) = ip(l-p)^ ,/ (p) + ^(p(l-p)<(p) + (l-2p- A1 ( P ))7r e (p)y(p) + 

2 7T e (p) 

^(^ 1 <--.^»)i 1 (^) W( ' ) -* )) * + 

where ^ € C 2 ([0, 1]) and k £ = w(p)-K e (p)dp. It is easy to read off the behavior of this process 
from its generator. In particular, we see that p e (-) can only jump when it is present in the region 
[0, e) U (I — e, 1] and that the rate at which jumps occur out of this region is equal to \n t / {eir e {p)) 
when p£ (1 — e, I] and A(l — K £ )/(e7r e (p)) when p £ [0, e). 

To relate these observations to the process p(-), let T > and notice that as e tends to 0, 
the sequence of processes (p e (j) converges in distribution on D[ 0jl ]([0, T]) to p(-). Furthermore, 
because time reversal is a continuous mapping on D[ ,i]([0j T]), the continuous mapping theorem 
[7] implies that the sequence of processes (p t (-)) converges in distribution to the process p(-). In 
particular, this suggests that p(-) has the following behavior. For each e £ (0,1/2), define the 
additive functionals 

and suppose that for i = 0, 1, the limits Li(t) = lim e _>o Li e (t) exist for all t > 0. Here we would 
like to interpret Li(t) as the local time of the process p(-) at i £ {0, 1}. Then, by comparison 
with the jump-diffusion processes p e (-), we expect that if both boundaries are accessible, then 
p(-) is a jump-diffusion with diffusive motion in (0, 1) governed by Q which jumps from the 
boundary point to a random point in the interval (0, 1) distributed as ^w(q)Tr(q) dq as soon 
as Lq(-) exceeds an exponential random variable with parameter Xk and which jumps from the 
boundary point I to a random point distributed as (l — w(q))ir(q) dq on (0, 1) as soon as 
exceeds an exponential random variable with parameter A(I — k). Although these remarks are 
purely heuristic, we show below that they correctly describe the stationary time reversal of the 
pseudo- hitchhiking model and other jump-diffusions with generators of the form (2). 

2 Main result 

Although our principle concern is with the modified Wright-Fisher process corresponding to ([2]), 
we state our results for a more general class of jump-diffusion processes, which we now introduce. 
Let the forward process (p(t) : t > 01 be the jump-diffusion process on [0. f ] corresponding to the 
generator 

G<t>(p) = \v{p)H\ v ) + ApW(p) + Xw Q (p)(m - Hp)) + Aioi(p)(0(i) - Hp)), (6) 

for tfi £ C 2 ([0, 1]). In other words, the diffusive motion of p(-) is governed by the generator 

AHp) = Hp)H'{p) + Ap)H(p), 0GC 2 ([O,I]), (7) 

with infinitesimal drift and variance coefficients, /i(-) and v(-), respectively, while jumps occur 
at constant rate A > and move the process from state p £ [0, 1] either to with probability 
wo(p) £ [0, 1] or to I with probability w\{p) := 1 — w (p). Throughout this article, we will assume 
that the following conditions are satisfied. 
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Assumption 2.1. The infinitesimal mean and variance satisfy /Lt(0) > > jtz(l) andv(0) = v(l) = 
< v(p) for all p € (0, 1), respectively. Furthermore, v(-), fi(-) and Wq(-) are analytic functions in 
a neighborhood of [0, 1], and the infinitesimal variance has non-zero derivatives v (0) > > v (1) 
at the boundaries. 

For example, if A is the generator of a neutral Wright-Fisher diffusion (fTI) (with s(p) = 0), 



then Assumption 2.1 is satisfied with /i (0) = /xq > 0, = < 0, and v (0) = 1 = — v (1) 



We also remark that when Assumption 2.1 is satisfied, Lemma 3.1 shows that (p(t) : t > 0) has 
a unique stationary distribution Tr(p)dp with a density 7r(-) that satisfies a second order ordinary 
differential equation with non-local boundary conditions. 

In Theorem 2.1, we characterize the time- reversed process (p(t) : t > 0J of the forward process 
(p(i) : t > 0J . In keeping with the heuristic description given in the Introduction, (p(t) : t > 0J is 
also a jump-diffusion process on [0, 1] but now with jumps from the boundary {0, 1} to the interior 
(0, 1). The diffusive motion of this process is governed by the generator 

A<j>{p) = \v{pW'(j>) + KpW{p) where fi(p) := - M (p) + (8) 

and ip E C 2 ([0, 1]). Notice that this diffusion has the same infinitesimal variance as the forward 
diffusion, but has a different infinitesimal drift that depends on the jump events via the stationary 
density 7r(-). Also, the jump rates of the time-reversed process depend on a local time process 
which is described in the following way. Recall that the scale function and the speed measure 
associated with A are 

S(p):= f cxp(- f 2 ^-dz)dx and rh(dp) := 1 dp, p€[0,l], (9) 
J\ V J\ v ( z > ' v(p)S (p) 

respectively. The scale function will be identified with the associated measure S(dp) := S (p)dp 
on [0, 1] and the speed measure fhldp) will be identified with its density function. 

We define the local time process of the jump-diffusion p(-) such that it agrees with the local 
time process of the diffusive motion until the first jump. More formally, we will introduce a non- 
negative process (L p (t): t>0,p£ [0,1]) which is almost surely continuous in (t,p) and which 
satisfies 

<■* f 1 

f(p(u))du= f(p)L p (t)rh(dp) a.s. t>0 (10) 



for all measurable /: [0, 1] — > [0, oo). We remark that the local time process satisfying (10) differs 



from the semi-martingale local time of the diffusive motion of the time-reversed process by a scalar 



factor (see Eq. (77)), i.e., L is a weighted semi-martingale local time. That this process is well- 
defined is shown below in Lemma |6.1| The last ingredient needed in our construction is a pair of 
independent, exponentially-distributed random variables, Rq and R\, with parameters 

n := lim f e [0, oo] (11) 



p-s-i V ir(p) 

where ft, := Io w i(p) 7T (p)dp, i G {0, 1}- The existence of the limit displayed in ( [TT] ) is guaranteed 
by Lemma |4.3| By convention, Ri := if fj = oo and Ri :— oo if = 0. 

With these definitions, we now describe the dynamics of the time-reversed process (p(t) : t > 0) . 
Between jump times, (p(t) : t > 0J evolves according to the law of the diffusion governed by A. If 
this diffusion hits a boundary i £ {0, 1} at a time t > and if at that time the local time process 
exceeds the random variable Ri, that is, if Li(t) > Ri, then p(-) jumps from i to a random point 
chosen from (0,1) according to the distribution — J Wi(p)ir(p)dp. From this point, p(-) restarts 
independently of the sample path up to that time. 

To better understand how the dynamics of p(-) are influenced by the boundary behavior of the 
forward process, we take a closer look at the jump times. Because the coefficients u(-) and /x(-) are 



G 



smooth on an interval containing [0, 1], an application of Feller's boundary classification criteria 
shows that a boundary point i € {0, 1} is accessible to t he forward diffusive motion if and only 



if 2|/x(z)| < \v Then, in conjunction with Lemma 3.2 which describes the asymptotics of the 



density n(p) near the boundaries, Lemma 4.3 implies that 

/m(v) \ f e(0 ' oo) if2|/i(i)|<|«'(»)| andA«Ji(-)^0 
n := lim(— jyXKi) { = oo if 2|/i(i)| > \v and X Wi (-) ^ (12) 

P npj [=0 i{\wi(-) = 

for z £ {0, 1}. Thus, provided that XiVi(-) ^ 0, the time-reversed process immediately jumps into 
the interior (0, 1) if the boundary point is inaccessible to the forward diffusive motion, that is, 
if 2|/i(i)| > \v In this case, the state space of p(-) is in fact [0, 1] \ {i}. In contrast, if i is 
accessible to the forward diffusion and Xwi(-) > 0, then the exponential random variable Ri is 
almost surely positive and so a positive amount of local time will have to be accrued at i before 
a jump occurs off of this boundary point. 

Notice that, in either case, we expect that both boundary points are accessible to the backward 
diffusive motion. According to Lemma |4.1| 



m-{*?, ... ■ 'MM), (is) 

I v (i) -nil) if 2|/x(z)| > \v 

and again an application of Feller's boundary criteria shows that the boundary point i is acces- 
sible to the backward diffusive motion whenever 2/i(i) ^ v (i). The critical case is more subtle. 
Then, 2/i(i) — v (i), and so i would be inaccessible if the drift coefficient fi(-) were analytic in a 
neighborhood of i. However, we show in Lemma [Li] that 



and then Feller's criteria reveal that the logarithmic singularity is just sufficient to render the 
point i accessible to the backward diffusive motion when 2/2(i) = v (i). 

Our main result states that the process p(-) has the same law as the stationary time reversal 
of the jump-diffusion p(-). 



Theorem 2.1. Assume 2.1 Let p(-) be the jump-diffusion on [0,1] with generator G as defined 
in ^ . Then the process Jp(t) : t > 0) is a version of the stationary time reversal of {pit) : t > 0) , 
that is, 

(p(t) :t<T)~ (p{T -t):t<T) V T > (15) 
if the distribution of p(0) is the stationary distribution n(p)dp. 
The proof of Theorem |2.1| is deferred to Section [7] 



Theorem 2.1 establishes the time reversal of the stationary process over a fixed time interval 
[0,T], T < oo fixed and non-random. Readers being interested in other pathwise time reversals 
are referred to the literature. It has been shown that processes which arc in 'Hunt duality' (see 
[H Chapter VI]) are time reversals of each other. Reversing time at the end point of an excursion 
from an accessible boundary point results in the dual process being started at this boundary point, 
see [9[[2T]. The paper of Mitro [22] reverses time at inverse local time points. 

The remainder of the paper is organized as follows. The next section collects some results 
concerning the stationary distribution of the jump-diffusion process ([2]). Section [1] describes the 
boundary behavior of p(-). In particular we show that the time-reversed process jumps immediately 
off of any boundary that is inaccessible to the forward diffusion. In Section[5]we identify a core for 
the generator G satisfying the adjoint condition The local time process of p(-) is introduced 
and studied in Section [HJ Finally, Section [7] shows that p(-) has generator G. The proof of this 
result depends on an application of the Ito-Tanaka formula. 
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3 The stationary distribution 



The following lemma asserts that, if the conditions of Assumption |2. 1| are satisfied, then the jump- 
diffusion process p(-) has a unique stationary distribution on [0,1]. It is also shown that this 
distribution has a density ir(-) with respect to Lebesgue measure which satisfies a second-order 
ordinary differential equation (ODE) subject to boundary conditions that are non-local whenever 
A > 0. If A = 0, then this equation can be solved explicitly, leading to the familiar expression 

7r(p) = C- 1 ^- exp (2 J" fi(q)/v(q)dq) , (16) 

where C is a normalizing constant, e.g., see Section 4.5 in [8]. Although a general closed-form 
expression for 7r(-) apparently does not exist when A > 0, ir(-) can be calculated by numerically 



solving (17) using a modification of the shooting method [5S]. In addition, below we give an 
explicit formula for the stationary density in the important special case of a neutral Wright-Fisher 
diffusion subject to recurrent bottlenecks. 



Lemma 3.1. Assume 2.1 Then there exists a unique stationary distribution for the process 
(p(t): t > 0). This distribution is given by l(o,i) {p)^(p) dp where ir: (0, 1) — » (0,oo) is the unique 
solution of the non-local boundary value problem 

((i W r)"-(M7r)'-A7r)(p)=0 Vpe(0,l) 

lim (fnr — (hvir) )(p) = A« 

p— >o 

lim (/Z7r - (§wr) ) (p) = -Xki 

lim(i>7r)(p) = = lim(u7r)(p) 
1 

n(p)dp = 1, 

where Kj := u>i(p)Tr(p)dp for i € {0,1}. Furthermore p(t) converges in distribution to the 
stationary distribution as t —} 00 for every initial distribution of p(0). 

Proof. Existence and uniqueness of a stationary distribution ir(dp) follow from standard argu- 
ments, so we only give a sketch. Couple two versions of (p(t) : t > 0) with different initial distri- 
butions through the same jump times such that the diffusive motions in between jumps are inde- 
pendent until they first meet and arc identical thereafter. Due to the assumption /i(0) > > fi(l), 
the coupling is successful if there are no jumps, that is, if A = 0, see Theorem V.54.5 in [55]. In the 
presence of jumps (A > 0), the probability that both components jump to the same boundary is 
positive at every jump and, therefore, the two components agree eventually. As a consequence of 
this successful coupling and of compactness of [0, 1], p(t) converges in distribution to a probability 
measure n(dp) as t — ¥ 00 and Tv(dp) is an invariant distribution. 

Next we prove that ff(-) has a smooth density tt(-). Denote by (X(t)) t >o the diffusion governed 
by A (see ([l])). The scale function and the speed measure associated with A are 



S(p) := / exp(- / - dz]dx and m(p)dp := dp, p£ [0,1], (18) 

Ji V Ji v(z) J v(p)S (p) 

respectively. Existence and smoothness of the density 7r(-) will be derived from existence and 
uniqueness of the transition density Q(t;p,q) of (X(t)) t >o with respect to the speed measure. 
Existence of Q(t;p,q) is established in ltd and McKean (1974) [TT] ([2U] is more detailed in a 
special case) via an eigen-diffcrcntial expansion. To state this result more formally, we introduce 
the following notation. The interval defined in [TT] - here denoted by J* - is the unit interval closed 
at if is accessible, closed at 1 if 1 is accessible and open otherwise. For this, note that whenever 
(X(t)) t >o hits a boundary point, it immediately returns to the interior (0,1) because of the 



8 



assumption /z(0) > > Moreover note that the stopping time min{t > 0: X(t) $ I'} = oo 

is infinity almost surely. The generator of (X(t)) t > is defined in [TT] via right derivatives. As 
(X(t)) t >o is a regular diffusion, this generator coincides with 

m(p) dp\b (p) / 
for / e C 2 (I*). There exists a solution e(-f, •) = (ei(7, •), £2(7, •)) °f 

(Ae(7,-))(p) = 7e(7,p) V0<p<l 



K7,|) = (l,0) ^e' (7, |) =(0,1) 



(20) 



for every 7 G (— 00, 0] such that 7 n- e(7,p) is continuous for every p e I'. Based on these 
eigenfunctions, it is shown in [TT] that there exists a Borel measure 2(^7) from (—00, 0] to 2 x 2 
symmetric non-negative definite matrices 

^7)=fc£! s :itl) (2D 



such that 



s 2 i(g?7) ^22(^7) 



Q(t;p, q) = / e^e T (7,p)-s(d 7 )-e(7,p), (*,P, g) € (0, 00) X J* x /*, (22) 



is the transition density of (X(t)) t >o with respect to the speed measure m(-). Now as our jump 
diffusion p(-) could also jump to an inaccessible boundary, we need to extend p M> Q(t;p, q) onto 
[0, 1]. Note that if i g {0, 1} is inaccessible, then i is an entrance boundary due to the assumption 
(— l)V(z) > 0- As in Problem 3.6.3 in [11], one uses the Markov property to extend (X(t)) t >o to 
the state space [0, 1]. Thus we may assume Q(t;p,q) to be defined on (0,oo) x [0, 1] x I*. 

With these results on the transition density of (X(i)) t > , we now establish existence of a 
smooth density of 7f(-)- Define kq <E [0, 1], k\ := 1 — kq by 



Hi ■= I Wi(p)n 
Jo 



(dp) for i e {0, 1} (23) 



and observe that Ki is the probability that a stationary version of the process jumps to the 
boundary point i when it jumps. Recall that the jump times of p(-) form a Poisson process with 
rate A and that in between jumps, p(-) evolves according to A. If U is any Borel measurable set 
in [0, 1], then by conditioning on the time and distribution of the last jump, we have 



Tt(U) = K / Ae~ At / Q(t;0,q)m(q)dqdt + Kl / Ae~ At / Q(t;l,q)m(q)dqdt. 
Jo JU JO JU 

Interchanging integrals, we infer that tt(-) has a density with respect to Lebesgue measure and we 
set n(q)dq :— Tr(dq) where n: (0, 1) — > [0, 00) satisfies 



1 />oo />0 

j=0 ^° •^~ <x 

3=0 J -' x 



e 7 *e T (7,j)s(rf7)e(7,9) dt 



-00 A - 7 
Xm(q) (k o G a (0, q) + k x G x {1, qfj . 



A Z T { 1 , ] ) 5 (d 1 )z{ 1 ,q)dt (24) 
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2.1 



we conclude that the 



The function G\(p, q) is the Green's function and is C 2 in the second vari able for every p £ [0, 1]. 
As the speed density m(-) is also C 2 in (0, 1) due to Assumption 
stationary density tt(-) is twice continuously differentiable. 

The main step of the proof is to show that ir(-) satisfies (17 1. By Proposition 4.9.2 of Ethier 
and Kurtz [7j, the stationary distribution ir(p)dp satisfies 



G(f>(p)ir(p)dp = 



(25) 



for all <f> S C 2 ([0, 1]). Let < e < \. The functions v, n, tfi and ir are C 2 in [e, 1 — e]. Integration 
by parts yields 



l-£ 



G(f>-~K dp — </>(0) 



l-E 



\Wq-1T dp 



1-e 



(f) -{\vk) + (j) -(/i7r) — A07r dp 

'W e + [^(MT-(W)]=~ B - 



Aii>i-7r dp 



(/X7r) — (^W7r) + \tt dp. 



(26) 



By considering all functions <f> £ C 2 with support in (e, 1 — e) and then letting e — > 0, we conclude 
that ir(-) satisfies the second-order ODE in (17 1. Furthermore, because the functions G(f>, wo, W\ 
are bounded and n is integrable, we may apply the dominated convergence theorem to theintegrals 
on the left-hand side of (26) as e — > 0. Together with (25) this shows that 



As 



Hm[0' lwr]i- e + </>(l)-lim(/i7r- (§wr)') (1 - e) - ^(0)dim(^7r- (|wr)')(e) 
= -0(1)Aki - <?(>(0)Ak o . 
was arbitrary this implies the non-local boundary conditions in (|17|). 



(27) 



If 7r(-) is another normalized solution of (17), then reversing the previous arguments shows 
that (25) holds with tt replaced by tt(-). This in turn implies that n(p)dp is another stationary 
distribution and we conclude that tt = ~k. It remains to show that 7r(-) is strictly positive. Assuming 
7r(p) =0 for some p £ (0, 1) , we conclude that tx (p) = from p being necessarily a global minimum. 
However, the only solution of the second-order ODE in ( 17) satisfying ?r(p) = = n (p) is the zero 
function, which contradicts the assumption that 7r(-) is a probability density. □ □ 

Remark 3.1. Lemma 3.1 can be used to find an explicit formula for tt(-) when the jump- diffusion 
process is a model of a neutrally-evolving population subject to recurrent bottlenecks, i.e., when 
p(-) has generator 

gHp) - \ P (i - pW'{p) + (Ml -p)- (p) + A (^(!) + (i - p)<K°) - 0(p)) • 

In this case, (17) is a hypergeometric equation and, using the fact that the mean frequency of allele 
A\ in a stationary population is /iq/(ho + /ii), we find that the density 7r(p) is equal to 

n(p) = C-yw-^l - p) 2 ^- 1 [fi F(l - a, 1 - 6, 2/x ,p) + inF(l - a, 1 - b, 2^, 1 - p) 

where C is a normalizing constant, F(a,b,c;z) is Gauss' hypergeometric function, and the con- 
stants a and b are determined (up to interchange) by the equations a + b = 3 — 2(p a + fj,i) and 
ab = 2(A + 1 — fio — Ml)- 

The second lemma of this section provides information on the boundary behavior of the density 
of the stationary distribution. This information is derived using results on second-order ODEs 
with regular singular points. 

We adopt the Landau big-0 and little-o notation. In addition, for two functions ^i(-) and 
V>2(0> we wr ite ijj\(p) ~ ip2(p) as p — > i if both ipi(p) = 0(^2 (p)) and tp2(p) — 0(ipi (p)) as p — > i. 
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Lemma 3.2. At 



2.1 Let 7r(-) be the density of the stationary distribution of the jump- 



diffusion p(-) corresponding to the generator ([6]). Then, for i e {0, 1}, 7r(-) is equal to 



7T(p) = < 



Ci\p-i\ ?w +0(1) 
r^ln(^) +0(1) 

\v (t)| V|p-*|/ v ' 



i/2| M (i)| < |t/(i)| 
if 2|*(i) = 



|2^(0-^ («)l + ) ^2|/i(<)| >|«'(<) | and Atu<#0 

{ d\p-i\ «'«> +0(|p-i|»'(o) 



(28) 



i/2|/i(i)| > |« and AtOj = 

as p ^ i where Ci € (0, oo). in addition if2/j,(i) = v (i) and Xwi = 0, i/ien ir(i) > 0. 

Proof. We only consider i = as the case i = 1 is analogous. We begin by observing that i = 
is a regular singular point for the differential equation in (17 1, see e.g. Section 9.6 in [3] for this 
concept. The associated indicial equation for is 



v (0) 



(29) 



and has roots a := and /3 := 2m( ° ) ,~" (0) ■ Note that /3 > -1. If a - /? £ Z, then Theorem IX.7 

in [5] tells us that ir(-) is equal to a linear combination of b\(p) := p Q (l + /ii(p)) and 62(f) := 
p^(l + h 2 (j>)) in a neighborhood of where and /12 are suitable analytic functions satisfying 
h 1 (0) = = h 2 (0). 

If a — f3 £ Wi, then Theorem IX. 8 in [3 shows that tt(-) is equal to a linear combination 
of bi(p), b 2 {p) and \n(p)bi(p) in a neighborhood of 0. If 2fi(0)/v (0) € N> 2 , then assuming 
7f(p) = —c\ ln(p) + O(l), 7r (p) = —C\~ + O(l) for some constant c\ > leads to the contradiction 



00 > A«o = (m(0) — 5 W (0)) hm 7r(p) — hv (0) limp7r (p) = 00 

p— >o p— >o 



(30) 



where we have used (17 1. Therefore ln(p) does not contribute to 7r(-) if 2/i(0) > w (0). 

It remains to calculate the coefficients. In the case 2/x(0) ^ v (0), insert 7r(6) = cibi(p)+c 2 b 2 (p) 
into (17) to obtain the coefficient c\ 



Xkq = lim 

p^0 

= lim 

p-s-0 



(M(P) ~ \v'{p))tt{p) - ^v(p)Tr'(p) 
( M (0) - V(0))( Cl + c 2 /) - ^'(QjpcaV 1 



(31) 



(m(o)-^'(o)) Ci . 



Of course if Ako = 0, then ir(-) ^ implies c 2 > 0. Next we show that Ako > together with 
2^(0) < v' (0) implies c 2 > 0. 



Assuming c 2 = implies that 7r(0) 



2Ak 



2 p.(0)-v (0) 



< which contradicts n(-) being a density 



function. In the critical case 2/i(0) = v (0), (17) implies that 

Xk = --v (0) lim (pn {p)). 
A p-s-0 



(32) 



Therefore the coefficient of — ln(p) is ^TqT- If 2/^(0) = v (0) and Ak = 0, then assuming 7r(0) = 
implies ?r(p) = cp n + 0(p' i+1 ) with c + 1 and n > 1. Inserting this into the ODE in (17) leads to 

= \v (0)c( P n+1 )" - M (0)c(p n )' + 0(p") 



(n+l)n „_-, v (0) „_-, „ % 

2 ( )CP ~ n 2 P + ^ 



(33) 
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as p 
1. 



0. Dividing by nv (0)cp n 1 /2 and letting p — > results in the contradiction n + 1 = 



□ 



□ 



4 Boundary behavior of the time- reversed process 

We begin this section by characterizing the boundary behavior of the infinitesimal drift coefficient 
of the time-reversed process. This information is of interest for two reasons. First, it will be 
used to establish that any boundary point that is accessible to the forwards-in-time process, 
either diffusively or via jumps, is accessible to the diffusive motion of the time-reversed process. 
Secondly, we also expect the time-reversed process to have the same state space, [0,1], as the 
forward process. Indeed, if a boundary point i is inaccessible to the forward diffusive motion, then 
subsequent results will show that the time-reversed process jumps back into the interior as soon 
as it hits a boundary. If, however, i is accessible to the forward diffusive motion, then because 
the time-reversed process may visit i without jumping, we need to confirm that p(-) does not then 
wander outside of [0,1]. To this end, we will show that fl(0) > whenever is accessible and 
similarly that //(l) < whenever 1 is accessible. 



Lemma 4.1. Assume 
in Q satisfies 

I « (i) - fi(i) 
as p — > i for i g {0, 1}. 



2.1 Then the drift function fl(-) of the backward diffusive motion defined 



+0{\p-i\) 



if2\fj,(i)\ < \v (i)\ or Xm = 
_ ,. if 2p,(i) = v'(i) and Xn { ^ (34) 

+0(\p-i\) if2\n(i)\ > \v'{i)\ and Xm ^0 



V In \ p- 



Proof. Recall that lemma 3.2 describes the asymptotic behavior of 7r(-) as p — > i. From this we 
obtain 



v(p)ir\p) 

7r(p) 



„'C,-1 2 m(0— " (») 

U \ l > v'(i) 

v (») 



-0(\p-i\) 



In |p- 




-Ol 



An\p-i\/ 

+0(\p-i\) 



if 2\fi(i)\ < \v or Xk. l = 

if 2/i(i) = v (z) and Ak^ 
if 2\fj,(i)\ > \v'(i)\ and Ak, ^ 



(35) 



as p — > i for i e {0, 1}. Inserting this into 



Hp) ~ ~Kp) + 



(vn) (p) 
ir(p) 



= - f i(i)+v'(i) + 0(\p-i\) + 



v(p)ir'(p) 

7r(p) 



results in assertion (34). 



□ 



(36) 



□ 



Remark 4.1. Notice that fl(0) < if fi(0) > ^'(0), so that the diffusive motion of the time-reversed 
process need not be confined to [0, 1] . Nonetheless, because the boundary p = is inaccessible to 
the forward diffusion in this case (i.e., 2/^(0) > 7/(0)^, the fact that the process p(-) immediately 
jumps back into (0, 1) upon hitting will ensure that the jump -diffusion is confined to [0, 1]. 

We next show that if a boundary point is accessible to the forward jump-diffusion p(-), either 
diffusively or via a jump, then it must be accessible to the backward diffusive motion governed by 
A. Recall the scale function S from (J9j) . 



Lemma 4.2. As 



2.1 



The boundary point i £ {0, 1} is accessible to the diffusive motion 
governed by A, that is S(i) € 1R ; if and only if i is accessible to the forward jump-diffusion p(-), 
that is, if Xwi ^ or \ft(i)\ < \v 
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Proof. W.l.o.g. we only prove the case i = 0. According to Lemma 15.6.1 in [14], the boundary 
point is accessible if and only if 5(0+) is finite. (This is a special case of Feller's boundary 
classification criteria.) Substituting the asymptotic expression for fx near p = (see Lemma 4.1 1 
into the definition of 5, we obtain in the case Xkq > 



5 (p) := exp 



dz 







p 




1 


1 


V 


(ln(p)) 2 




2„'(0)-2 


p 


"'(0) 



if 2|/x(z)l < \v'(i)\ 
if 2/x(i) = v\i) 

if 2^(1)1 > 



(37) 



as p — >• 0. In all three cases, 5 
arguments. 



is integrable over (0, |]. The case Akq = follows from similar 



□ 



□ 



The following lemma shows that the rate constant (defined in (11)) is equal to infinity if 
the boundary point i € {0, 1} is inaccessible to the forward diffusive motion. Therefore p(-) jumps 
whenever it hits i as Li(t) > = Ri. In addition, if there are no jumps in the forward process, 
then j-j = 0, and p{-) never jumps as Li(t) < oo = Ri. 



Lemma 4.3. Assume 2.1 Then the rate constant ri used to define the jump times ofp(-) satisfies 




if2\n(i)\ < \v {i)\ and AiOi(-) # 
£/2|/i(i)| > \v (i)\ and Xwi(-) 
if\Wi(-)=0 



(38) 



for ie {0,1}. 



Proof. W.l.o.g. we assume i = as the case i = 1 is similar. If Ako = 0, then ro = is trivially 
correct. Assume Xkq > for the rest of the proof. The asymptotic behavior of the scale density 
5 (•) is given in (37). From this we derive the asymptotic behavior of the speed density rh(p) 
(defined in J9J) as p — > 



rh(p) = 



v{p)S'{p) 



2n(0)-n (0) 
p „>) 

(Hp)) 2 



p 



if 2/i(Q) < v'(0) 
if 2^(0) = u'(0) 

if 2^(0) > v'(0). 



(39) 



Compare (39) with the boundary behavior of 7r(-) (see Lemma 3.2) to obtain (38). 



□ 



□ 



5 The generator of the time-reversed process 

In this section we identify the generator of the time-reversed process and show that this operator 
satisfies the duality condition given in (J2|. That this operator is also the generator of the jump- 
diffusion process p( ) described in Section [2] will be established in the final two sections of the 
paper. 

The following notation will be needed to formulate the generator of the time-reversed process. 
If v(dp) — f(p)dp is a measure on [0, 1] with continuous density /: (0, 1) — > (0, oo) with respect to 
Lebesgue measure, then we write 

V / y^P f(q)dq 

whenever this limit exists in R and denote by 

35 (A,) := lip £ C([0, 1]) : D v if)(jp) exists Vp £ (0, 1), D v ijj is continuous 

L , (41) 

in (0, 1), D v ij)(Q+) and D v ip(l-) exist in R| 
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the subset of functions which are mapped to continuous functions on [0,1]. Note that ip G 
C x ((0, 1)) and D„ip(p) = p G (0,1), for every ip G D(D U ). For ip G £>(£>„) the defini- 

tion of D u extends to the boundary via D v ip(<S) := D v tp(0+) and D v ip(l) := D u ip(l—). In this 
notation, the generator A of the backward diffusive motion reads as 

(Aip)(p) = §«G#"(p) +£(p)v'(p) = |«(p)^(p)(i/ + i— V»')(p) = D * D s1>b) 

for every ^ G C 2 ([0, 1]). The following set will be a core for the generator of the time-reversed 
process 

:= i ip G ©(-Ds) : € 33 (Ah) and for i G {0, 1} 

/* I (42) 

lim^W )(p) = (-1) 1+1 / (</>(p) - ^(i))Ati;i(p)7r(p)dp 



H 



J)— K 



The following lemma asserts that the restriction of Dj- n D s to H extends to a strong generator of 
a Markov process. Indeed, this can be deduced from Theorem II. 4 of Mandl (1968) which shows 



that the restriction of D,nD s to 



ip G ^(D§) '■ D s^ e ®(An) and if i G {0, 1} is accessible: 

(43) 



1 m-Hp) 

o \Pi{i) - Pi(p)\' 



is the strong generator of a Feller semigroup if Xi and rji are both non-negative, if qi is a non- 
decreasing function on [0, 1], and if pi is continuous, non-decreasing and equal to S in a neigh- 
borhood of i, i = 0, 1. Only the case = xo = Xi = Vo = Vi = <z(l — ) — 7^ <Z(1) — 9(0) is 



excluded. The quotient in (43) is to be interpreted as (— l) l+l D s ip{i) for p = i G {0, 1} and the 



integral with respect to dq^p) denotes the Lebesgue-Stiltjes integral with respect to 



Lemma 5.1. Assume 2.1 The restriction of D^Dg to the set H extends to a strong generator 
of a Markov process. 



Proof. By Theorem II. 4 in [19] , it suffices to prove that H is of the form ( 43 1 . According to 
Lemma 4.2 the boundary point i G {0, 1} is accessible to the diffusion governed by D^Dg if and 
only if Xwi(-) ^ or \(J>(i)\ < \v First we show that the condition in (42) is trivial if i is 
inaccessible, that is, we show lim p _>.i (v7rip )(p) = for every ip G T>(D S ). Suppose that 

C := lim (vnip') Op) > (44) 



for some function ip G 33(-Dg). By Lemma 3.2 nOp) is bounded above by — Cln(|p — i\) in a 



neighborhood of i for some C > 0. Thus, in a neighborhood of i, 

> — C C 

^ (P) - 2v(p)tt{ P ) ~ 4C\v'(i)\\p~i\\n(\p-i\)' ^ 

Integrating over [\,p] implies that ip{p) is bounded below by 4g ^ ( ..^ In (— ln(|p — i\)) as p — » i 

which contradicts ^ G C(J). An analogous argument applies to the case C < 0. 

Now, let i be accessible to the forward process, that is, A«j > or \/i{i)\ < \v (i)\. Note that 
S(-) is a bounded continuous non-decreasing function in a neighborhood of i. Choose XiiVi = 
and pi Op) := SOp). Furthermore, let 

nip) : = / \Pi(-i)-Pi(y)\m(y)^(y)dy + c i { o,i](p)ii=o + c 1 i 1 (p)i i=1 (46) 
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where cq, c\ 6 [0, oo) are to be chosen later. Note that q{ is bounded and that dqi puts mass Cj on 
the point i. With these definitions, the condition in (43) takes the form 



(47) 



O00 - ip(p))m(p)^{p) dp = (-iyD § ip(i)c 

It remains to choose Ci € [0, oo) such that 

Dgip(i)ci = \im(^virip )(p) 



p— f 2 



(48) 



for every ^6H. Using the boundary behavior (37) of S(-) and the asymptotic behavior (28) of 
7r(-), we arrive at 



(vnip )(p) ~ |p-i|-7r(p)-5 (p)-D§ijj(p) 



if2| M (*)|<h,0O| 
l if 2 M (i) =*'(<) 

In [\p-i\) 

D-Mp^p-ififr 1 if2|/*(t)|>|«'(t)| 



D^(p) : 



as p — >• i. This shows that (48 1 holds with some constant Cj S [0, oo). 
Lemma 5.2. Assume 



□ 



□ 



Lei i/ie process (p(t) : i > 0) fee in equilibrium. Then the time-reversed 
process (pt: t > 0) exists, that is, there exists a process (pt'. t > 0) satisfying 

(p(t) :t<T) = (p(T -t):t<T) V T > 0. (49) 

In addition, E is a core /or i/ie generator G of (pt: t > 0) and 

GV> = + A^' = D^Dgij: V V> S H. (50) 

G is the strong 



5.1 



Proof. Let G be the closure of the operator defined in (50). By Lemma 
generator of a Markov process (pt : t > 0). Recall the generator G of p(-) from (6]). We will prove 
that G is the adjoint operator of G with respect to the invariant measure ■n(p)dp. Let < e < \, 
4> G C 2 ([0, 1]) and i/)€H. The functions v, fi, (f>, ip and tt are C 2 in [e, 1 — e] . Integration by parts 
yields 



1-e 



G<j>-ip-ir dp - <t>(Q) 



l-s 



1-e 



\wq-%1)-1T dp — (f)(l 

4> -(^vipir) + <j) ■(fxtpTr) — Xcjitpir dp 



\wi-ip-ir dp 



(51) 



+ 



(■kvipir) — (A*'0 7r ) — A"07r 



dp. 



As 7r satisfies (17), we see that 



(^vi/jit) — (fJ/ipn) — \ipir = \vijj tt + (- — fxjip tt — Df n Dgip-ir. 



(52) 



The functions G(f>, ip, Wq, W\, <f> and D^Dgil) are bounded and tt is integrable. Hence, we may 
apply the dominated convergence theorem to the integrals in (51 1 as e — > 0. This also proves that 
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the limits of the boundary terms in (51 1 exist as e — > 0. Thus letting e — > in (51 1, we obtain 



Gcjy-ipir dp 
o Jo 



/ cfi-D^Dgifj-Trdp 
Jo 



= 4>{1)(^(\)(^it - (§ott)')(1-) - lim (\vKij))(p) + J Xwyip-n dp^j 

- 0(0) (V(0) {lilt- (|ott)')(0+) - lim (±wr^')(p) - [ \w Q -^-irdp 
\ " P^ Jo 



(53) 



= 

for all (f) S C 2 and -0 S H. The last equality follows from ( 17 1 and from € H. This proves that G 



and G are adjoint to each other. Consequently, the semigroups of (pit) : t > Oj and of (pt : t > 0) 
are adjoint to each other. According to [53] > this implies that the Markov process (p(t) : t > Oj 
associated with G has the same law as the time-reversed process of (p{t) : t > 0) . □ □ 



6 The local time process 

This section describes some properties of the local time process of p(-). First we show existence. 
Recall the scale function S and the speed measure rh from Q. 



Lemma 6.1. Assume 2.1 Then there exists a unique, non-negative process 

(Lp(t):*>0,pG[0,l]) (54) 
which is almost surely continuous in (t,p) and which satisfies 

f(p{u))du= / f(p)L p (t)m{dp) \ft>0 (55) 



for all measurable f: [0, 1] — > [0, oo) almost surely. In addition, if 2\/j,(i)\ > \v (i)\ for i € {0, 1} ; 
then Li(-) = almost surely. 

Proof. Let =: tq < t\ < < • • • be the jump times of p(-). Then, by construction of 
p(-), (p(t + T n -i): 0<t<T n — T n -x), n 6 N>i, are independent diffusions governed by A. It 
is well-known that p{- + r n _i) can be written in terms of a Brownian motion as follows. Let 
{B yn : j£R,ne N} be a family of independent standard Brownian motions with B y ' n = y. 
Denote by (L^ ' (*) '■ t > 0,x <E R) the local time process of B y ' n , see e.g. Section 2.8 in [IT], and 
define 

d n) := t Lt^-^ n {t)fh{dp) t>0. (56) 



S(p) 

Then a version of p(- + r n —i) is given by 



P(t + T n -l)=S- L S/ f ;,-i V ' 0<t<T„-T„_i. (57) 



Inserting this into the occupation time formula of the Brownian motion, a short calculation (see 
e.g. Section 5.4 in [TT]) shows that 

f(p(r + T n ^))dr= f f{p)L p n \t)m[dp) 0<t<r„-r n _ 1 , (58) 



lo Jo 
where the local time process of p(- + r„_i) with respect to the speed measure is 

L^t) := Lf^ n - l)),n (i^)- 1 ) 0<t<T„-r»_ 1 ,|>6[0,l],ne%. (59) 
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Now we put the independent path segments together by defining 



L p (t) :=L^(t 



T n -1 



71-1 

E 

k=l 



Tk — Tfc-1 



if T n _l < t < T„ 



(60) 



It is easy to use (58l to show that (L p (t) : t > 0,p £ [0, 1]) satisfies (55|. Uniqueness follows from 
standard arguments. 

If 2|/i(i)| > \v (i)\, then p(-) jumps into (0, 1) as soon as it hits the boundary and we conclude 
that p(t) 7^ i for all t > 0. Thus the local time at this boundary point is identically zero. □ □ 

In the next section, we will need to be able control the second moment of the local time of the 
time-reversed jump-diffusion at a boundary point. We first prove the following estimate concerning 
the local time of a standard Brownian motion. 

Lemma 6.2. Let e, 8 > and let (B t : t> 0) be a standard Brownian motion with local time (•) 
at x £ IR. Suppose that the function S: [0,5] — > 1R is non- decreasing. Define A :— S(8) — S(0) 
and 



Ct ■= 



1 



Ll v) (t)dy t>0. 







Then, for each m > 0, there exists a constant C m independent of e and of 5 such that 

et 



for all p E [0, 6] and t>0. 



(61) 



(62) 



Proof. Inequality (62) is trivial if A — or t = 0, so we may and will assume that A > 0, t > 0. 
Fix p £ [0,6]. The left-hand side of (62) does not depend on the value of S(p), so we will also 
assume w.l.o.g. that S(p) = 0. Denote by B\ :— m&x{B s : s < t} and by \B\* t := max{|£? s | : s < t} 
the process of the maximum and the process of the absolute maximum, respectively. Define 
Z t := S~ 1 (B c -i) for t > 0. According to Section 5.4 in [11], L$(t) := L§ is the local time of 

(Zt ■ t > 0) at p. The process (Z t : t > 0) is equal in distribution to the process (5 _1 (i? et ) : t > 0) 
reflected at and at 8. Another way to construct (S(Z t ) : t > 0) is to take the path of {B Et : t > 0J 
and to identify each x £ [S(0), 3(6)] with the set {x + 2Az, 2S(8)-x + 2Az: z £ %}. Thus the local 
time (t) of (Z t : t > 0) inp is equal in distribution to the sum of L^ Az (et) +^^ z+2 5((5) ( £ ^) over 
z £7L. Note that (et) = almost surely on the event {|-B|* t < i£E. In addition, note 
that convexity of < x n- x n 
kN. Therefore, 



implies fc m - 1 (ai + • • ■ + a k ) m < (of H h af) for ai, . . . , a k > 0, 



E'' 



1[ 



/ -[fc,fe+2A) 
fee2AN> 



si J 



< E 



E E 

z ='^k a:6{0,2S(<5)} 



-^2Az+x( £ ^) 



E %wisi: t )-(!+2) 



m— 1 2 A 



L fcG2AN >0 



E 

z =~^k ^6{0,2S((5)} 



E 



(63) 



Use the strong Markov property (e.g. Proposition 2.6.17 in [T^]) to restart the Brownian motion 



at the first hitting time of 2Az and of 2Az 
is bounded above by 



2S(8), respectively. Thus the left-hand side of (63) 



]r p°(\B\: t £[k,k 

feG2AN>n 



< E 



rn — 1 2 A 

2) 2 £ EP[(Lf(et) 



(64) 
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Note that 2Ljf (i) and B\ are equal in distribution, see e.g. Theorem 3.6.17 in [13]. Therefore the 
left-hand side of ( 63 ) is bounded above by 



< 



E' 



Km 



1 



E' 



(65) 



1 



Km et 



where K m / 2 > 1 is a suitable constant which is independent of A, £ and t. The last step follows 
from the Burkholder-Davis-Gundy inequality, see e.g. Theorem 3.3.28 in [13]. Therefore (62 ) holds 
with C m := Kl. 



□ 



□ 



In the proof of Theorem 2.1 we will need to exploit the fact that, in the L sense, the local 
time (Li(t) : t > 0) at a boundary point i £ {0, 1} of the backwards process started at i decreases 
to zero faster than \fi as t — > 0. This might be surprising as one can show that 



E 



{L%(t))' 



as t — > 0. 



(66) 



However, the infinitesimal variance «(•) is zero in i. Thus, informally speaking, the diffusion 
governed by A is pushed away from zero almost deterministically at rate p,(i) > if the boundary 
point is accessible at all. 



Lemma 6.3. Assume 2.1 Then the local time at the boundary satisfies 



lim -E* 

t-yO t 



(ii(*)) S 



(67) 



forie {0,1}. 



Proof. If 2|/i(i)| > \v then Lemma 6.1 tells us that Li(t) = 0, which implies the assertion in 
this case. For the rest of the proof assume that 2|/i(z)| < \v W.l.o.g. we assume that % = as 
the case i = 1 is similar. To begin, we prove that ([67]) holds with L (-) replaced by Lq 1 ^-)- For 
S e (0, 1) define 

fn(6) = inf m(x). 

x<5 



The asymptotic behavior (39) of rh(-) implies lim^_»n m(5) = limp^o fh(p) = oo. Recall that B^ 1 

y. Observe that 



is a standard Brownian motion started at Bq' 1 



& ] ■= l~Lf^'\t)m(p)dp>m(6) 



Li 



S(0),1 



(y) ■ (t) dp=:( t Vt> 0. 



(69) 



Using ) < ( 1 we obtain an upper bound for L^O) as follows 



-E ,! 
t 

<1e 
- t 



4"w) J = jE 

rh(8) 



C 2 



< 



rh(S) 



m(S)(S(S)-S(0)Y 



-> 



(70) 



for some constant C 2 which is independent of t and 6. The last inequality is Lemma |6.2| 

Now we come to the local time process L (-). Recall r , R from Section [2] and let t\ be the 
first jump time of p(-) from the boundary point 0. The local time Lo(t) converges to zero almost 
surely as t — > 0. By the theorem of dominated convergence, this implies E°L (t) — > as t —> 0. 
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Thus there exists a, t > such that 7oE (Lq (i)) < 1/4 for all t < t a . Then we obtain from the 
definition (60) of L (-) and from the Markov property 



E l 



(Lo(t) 



L T 1 <t 



< 2E° 

< 2E° 

< 2E° 



(4 1} (t)) 2 ] +2E°[(Z (i)-4 1) (i)) 

(Lo(t)y 



4 1} (t) 



2e £ (p( t i)) 



1 L< 1) (t)>_Ro 



1 - e - r ° L ° ){t) 



■2W 



(ioCt)) 1 



for all t < to- Using this estimate we get for t < to 



E 



(^o(t)) ; 



Therefore 



lim-E° 



<E° 
< 3E° 



+ E° 
1 



(%,(*)) 
(%)(*)) 



2" 



< 6 lim-E° 



where the last equality is (70). 



□ 



(71) 

(72) 
□ 



7 The backward process 



In Section[5j we identified the generator G of the time-reversed process (see Lemma 5.2 ) However 



while it is clear that the boundary behavior of this process must be prescribed by the domain of 
the generator, it is difficult to see how a qualitative description of the process can be deduced 
from the analytical condition that defines this domain. To address this issue, we show in this 
section that the process p(-) defined in Section [2] also has generator G. This confirms the heuristic 
arguments given in the introduction and shows that the time-reversed process is a jump-diffusion 
process whose jump times depend on the local time process constructed in the preceding section. 

The proof of Theorem |2.1| is based on the Ito-Tanaka formula for semimartingales which involves 
the semimartingale local time process. Because this local time differs by a scalar factor from the 



local time process introduced in Section 6 (sec Eq. ( 77 ) below), we have restated the semimartingale 
Ito-Tanaka formula in terms of L p (-). This is done in the following lemma. 

Let (Y(t) : t > 0J be a diffusion corresponding to the generator A 



2.1 



Lemma 7.1. Assu 
defined in (8). Then for each £ H 



il>(Y(t))-i/)(Y(0))= [ D fh D^{Y{u))du+ [ ip'(Y u )Jv(Y u )dB u 

Jo Jo (73) 

+ ^Lo{t)-D- s i>{0)-H l {t)-D § i,{l) 

for all t > almost surely. 

Proof. Fix ip € H. We approximate tp with suitable functions and apply the semimartingale Ito- 
Tanaka formula. Denote by (Z p (f)) t>0 ^ the semimartingale local time process of (Y(t)) t>0 . 

We remark that, in general, this local time process is distinct from the local time, L p (-), introduced 
in the preceding section. By Theorem 3.7.1 in [T3] we may and we will assume that L p (t) is 
continuous in t and cadlag in p. The occupation time formula (Theorem 3.7.1 in |13j ) states that 

i>(Y(u))v(Y(u)) du = 2 [ i/j(p)L p (t) dp, t > 0, (74) 
Jo 
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almost surely. Let / be a continuous function which is C 2 except in {ai,...,a„} C [0,1] and 
which admits finite limits / (<Xfc+) and / (a k — ), k — 1, . . . ,n. Then the Ito-Tanaka formula for 
continuous semimartingales (see Theorem 3.7.1 and Problem 3.6.24 in |T3j) states that 

f(Yt) - f(Y Q ) 



f (Y u )[i(Y u )du + J f (Y u )\v(Y u )du + / / {Y u )^v{Y u )dB u 

n 

J2L ak (t)\f'(a k +)-f'(a k -) 



(75) 



k=l 



almost surely. 

For every n S N, let tp n be a continuous function which is equal to tp in (A, 1 — —) and which 
is constant both in [0, A] and in [1 — A, 1]. In addition, suppose that (VOneN approximates tp 
uniformly in [0, 1] and that (DmD gtp n ) ne ^ approximates D^Dgtp pointwise and boundedly in 
(0, 1). Note that 

D s Vn(--)=0 and D s i> n (-+)=D S il;(-). (76) 



Comparing the occupation time formula (74) of L p (-) with the occupation time formula (10) of 
Lp(-) we see that 



L p (t) = L p {t)\v{p)m(p) = L p (t) } 

1 2S (p) 



Vpe (0,1). 



(77) 



Now applying the Ito-Tanaka formula (75) to tp n (-) and inserting (77), we arrive at 



MYt)~MYo)= / D A DMY u )du+ / xP n (Y u )Jv(Y u )dB. 



+ \i L {t)D^ n (h - H^^D-^l - I). 

Zj I L i£l lb 



(78) 



Note that the Lebesgue measure of {u < t: Y u G {0, 1}} is equal to zero almost surely. Letting 
n — > oo in ( 78 ) completes the proof. □ □ 



Proof of Theorem 2.1. Recall G, D^Dg and H from Section [Hj Lemma 5.2 shows that the gen 



erator of the time-reversed process is the closure of G. Therefore it remains to be shown that the 
generator of the Markov process (p{t) : t > 0) restricted to the set H coincides with G, that is, 
that 



7 > Glp(p) = DrnDgtpip) 



(79) 



holds for all p <E [0, 1] and every ip € H. 

Recall Ri, ri, ki for i 6 {0, 1} from Section[2] Fix ip £ H and note that ip is C 2 in (0, 1). Using 
Ito's formula, it is straightforward to show that the convergence in (79) holds for every p £ (0, 1) 
if A«i ^ and holds for every p € [0, 1] if A«i = 0. It remains to prove (79) for i e {0, 1} if 
\ni ^ 0. Starting at i £ {0, 1}, (p(t) : t > 0) evolves according to a diffusion \ Y(t) : t > 0) which 
is governed by A until the first time t such that Li(t) > R{. At that time, the process restarts 
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from an independent random point Ji in (0, 1) with distribution — Wi(p)Tr(p) dp. Thus 



L ti(*)<i?, 



E^^(pi(t-0)-VW]^e" 



Pi/ 



V0 K i 



r ?; e 



-rd 



dl 



(80) 



E^l-e-^W] / [tp{z)-^{i)}-w l (z)TT{z)dz + 0(t-^L l (t)) 



o(t) + E l [»•<£,(*)] (-1) 4+1 lim (p) W- + 0(t-E%(*)) 



p— 



as t —> 0. In the last step we used the inequality 1 — e x — x < x 2 ioi x > together with 



Lemma 



6.3 



and The local time converges to zero a.s. as t — > 0. By the dominated 



convergence theorem, this implies that E l Li(t) converges to zero as t — >• 0. Thus the last summand 



(81) 



on the right-hand side of ( |80| is of order o(t). Furthermore Lemma 4.3 implies 
j-j lim (vinp )(p) = lim (vmip ){p) — DstpU). 



p— ¥i 



p—^i 



Next we consider the second expectation on the left-hand side of (80 1. Using Holder's inequality 
we see that 



E' 



L Li(t)>R 



(i>(Y(t))-i>(i)) 



E' 



(1 



)(MY{t))-iKi)) 



< WE 



{U{t)Y 



E* 



(82) 



where we have applied Lemma 6.3 Thus we obtain from the Ito-Tanaka formula (73 1 



E' 



o^+tDrhD^ii) + (-l)^E* -D^W 



(83) 



as i -> 0. Putting (|80[), (81 1 and (83) together completes the proof of Theorem 2.1 



□ 



□ 
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